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Abstract 

We study a multiple-input multiple-output (MIMO) multiple access channel (MAC) from several 
multi-antenna transmitters to a multi-antenna receiver. The fading channels between the transmitters and 
the receiver are modeled by random matrices, composed of independent column vectors with zero mean 
and different covariance matrices. Each transmitter is assumed to send multiple data streams with a 
random precoding matrix extracted from a Haar-distributed matrix. For this general channel model, we 
derive deterministic approximations of the normalized mutual information, the normalized sum-rate with 
minimum-mean-square-error (MMSE) detection and the signal-to-interference-plus-noise-ratio (SINR) of 
the MMSE decoder, which become arbitrarily tight as all system parameters grow infinitely large at the 
same speed. In addition, we derive the asymptotically optimal power allocation under individual or sum- 
power constraints. Our results allow us to tackle the problem of optimal stream control in interference 
channels which would be intractable in any finite setting. Numerical results corroborate our analysis 
and verify its accuracy for realistic system dimensions. Moreover, the techniques applied in this paper 
constitute a novel contribution to the field of large random matrix theory and could be used to study 
even more involved channel models. 
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I. Introduction 

The gains of having multiple antennas at the transmitter and receiver in wireless fading point-to- 
point channels are well established 0]], @. It is known since the Telatar's seminal paper [2] that when 
channel state information (CSI) is available at the transmitter, the optimal transmission strategy is to 
send independent data streams along the eigenmodes of the channel and to allocate power over these 
eigenmodes according to the water-filling principle |3]. If no CSI is available at the transmitter but 
the statistical properties of the channel are known, an optimal static power allocation which does not 
depend on the actual channel realizations and maximizes the ergodic mutual information can be found. 
Uniform power allocation is optimal when the channel entries are independent and identically distributed 
(i.i.d.) Gaussian Q or zero-mean symmetric H. It was shown in [5] that the optimality of uniform 
power allocation also extends to the multiple-access channel. When not ergodic but outage capacity is 
considered, it was conjectured |2] that allocating equal power to only a subset of the available transmit 
antennas is optimal. This conjecture was proved for the Gaussian multiple-input single-output (MISO) 
channel in 0. 

In the presence of co-channel interference, much less is known about the optimal transmission strategies. 
Recently, an exact expression of the ergodic mutual information of a Rayleigh fading multiple-input 
multiple-output (MIMO) system in the presence of different MIMO interferers with arbitrary transmit 
power levels was derived [7j. Since more involved channel models are intractable by exact analysis, 
many works resort to asymptotic analyses where each transmitter and receiver is equipped with a large 
number of antennas. The authors of JH consider a doubly-correlated fading MIMO channel with correlated 
interference and derive asymptotically tight approximations of the mutual information and its variance. 
In Q, the asymptotic mutual information and its fluctuations are studied for arbitrary fading channels 
with a variance profile and correlated noise. However, even in the asymptotic setting, the optimal transmit 
strategies in interference channels are in general unknown. 

An important question in MIMO systems with co-channel interference is whether a transmitter should 
use all of its antennas to transmit independent data streams or whether it should restrict itself to a 
smaller number of streams or antennas in order to reduce the interference to other receivers. In general, 
this problem does not have a simple solution and the optimal number of antennas to be used (or streams 
to be sent) depends on the strength of the interference and, thus, on the cross-channel gains between 
the interferers. The authors of (TO] pioneered this question, assuming that no CSI is available at the 
transmitters while full CSI is available at the receivers. Their main finding is that when the interference 
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is weak, a transmitter should send independent streams with equal power from each of its antennas, 
but when the interference is strong, all power should be put into a single stream which is transmitted 
by a single antenna. In ifTTTl . it was shown that optimizing the number of transmitted data streams is 
also helpful when CSI is available at the transmitter. Several later works lfT2l . |[T3l . fffl have studied 
the same problem in the context of dense random ad hoc networks under different assumptions about 
the availability of CSI at the transmitters and receivers and the corresponding transmit and reception 
strategies. Surprisingly, the conclusions in all of these works are similar, confirming the optimality of 
single-stream transmissions in dense, interference-limited networks. 

The aforementioned references share the underlying assumption that the channel matrices/vectors are 
composed of i.i.d. elements without any form of correlation. Thus, the problem of how many antennas 
should be used for transmission and how many independent data streams should be sent are the same. With 
transmit antenna correlation, however, it makes a difference which antennas are selected for transmission 
and the question of the optimal number of antennas to be used becomes a combinatorial problem. To 
circumvent this issue, random isometric precoding can be used to mitigate transmit correlation. The 
remaining question is then how many orthogonal streams should be sent, using all available antennas. 
This is the primal motivation of this paper, as our results allow to study the sum-rate of systems composed 
of multiple transmitter-receiver pairs, each applying random isotropic beamforming. Random isotropic 
beamforming [15] is a well studied technique in multi-user MIMO communication systems and unitary 
precoders [16 ] are now proposed as limited feedback beamforming solutions in future wireless standards 
ifTTl . lTT8l . Nevertheless, only few related works relying on tools from large random matrix theory have 
been published until today (e.g. [19]) and this paper might stimulate further research in this area. 

The contributions of this paper are as follows. We consider a MIMO multiple access channel (MAC) 
from several multi-antenna transmitters to a multi-antenna receiver. The transmitters are unaware of the 
channel realizations and send an arbitrary number of independent data streams using isometric random 
beamforming vectors. The receiver is assumed to be aware of all instantaneous channel realizations 
and beamforming vectors. We assume a very general channel model where the channel matrices are 
composed of independent, zero mean column vectors, each with a possibly different covariance matrix. 
This channel model allows to treat many classes of well-known channel models, such as matrices with 
a variance profile GUI as well as the Kronecker model Ell . Under these general assumptions, we 
derive deterministic approximations of the normalized mutual information, the normalized sum-rate with 
minimum-mean-square-error (MMSE) detection and the signal-to-interference-plus-noise-ratio (SINR) of 
the MMSE decoder, which become arbitrarily tight as all system parameters grow infinitely large at 
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the same speed. The expressions are given as functions of a set of deterministic quantities which can 
be computed by a standard fixed-point algorithm which is proved to converge. Moreover, we derive 
the optimal power allocation under individual or sum-power constraints which can be computed by an 
iterative water-filling algorithm. We then apply these results to find the optimal number of independent 
streams to be transmitted in a 2 x 2 interference channel. Although the use of deterministic approximations 
in this context requires an exhaustive search over all possible stream-configurations, it is computationally 
much less expensive than Monte Carlo simulations. Extensions to more than two transmit-receive pairs 
and possible different objective functions, e.g. weighted sum-rate or sum-rate with MMSE decoding, are 
straightforward. Our numerical results show that the deterministic approximations are very tight for even 
small system dimensions. We further show that with random beamforming, it is optimal to (i) send as 
many independent data streams as transmit antennas and (ii) allocate power uniformly over the transmitted 
streams. For the interference channel, we find that at low SNR, it is optimal to use all streams while at 
high SNR, stream-control, i.e., transmitting less than the maximal number of streams, is beneficial. Apart 
from these practical applications, our work also constitutes a novel contribution to the field of random 



matrix theory as will be highlighted in Section III 



The remainder of this paper is organized as follows: In Section |ITJ we present a detailed description of 

summarizes recent 



the system model and make several definitions of frequently used quantities. Section III 



results on large random matrix theory involving Haar distributed matrices, which will be extended in this 



paper. We present our main results in Section IV and show numerical results for practical applications 



such as optimal stream control for the MIMO interference channel in Section [V| The paper is concluded 



in Section VI Related results, lemmas and the proofs of all theorems are provided in the appendix. 



Notations: Boldface lower and upper case symbols represent vectors and matrices, respectively. Ijy 
is the size-iV identity matrix and diag(xi, . . . , xn) is a diagonal matrix with elements X{. The trace, 
transpose and Hermitian transpose operators are denoted by tr(-), (-) T and (-) H , respectively. The spectral 
norm of a matrix A is denoted by ||A||, and, for two matrices A and B, the notation A >- B means 
that A — B is positive-definite. The notations and denote weak and almost sure convergence, 
respectively. We use CM (m, R) to denote the circular symmetric complex Gaussian distribution with 
mean m and covariance matrix R. We denote by M.+ the set [0, oo) and by C + the set {z G C, lm[z] > 0}. 
Denote by C the set of continuous functions from X C C to Y C C and by S the class of functions 
/ analytic over C \ M.+, such that, for z G C+, / € C+, zf 6 C + and lim 2/ _ +00 —iyf(iy) < oo. Such 
functions are known to be Stieltjes transforms of finite measures supported by R + (see e.g. ll22lD . 
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II. System model 



Consider the following discrete-time MIMO channel with output vector y G C 

K 

y = ^H fe W fc p|x fc + n (1) 

k=l 

where, for k G {1, . . . , K}, 

(i) Hfc G <C NxNk is a random channel matrix whose jth column vector hkj € is modeled as 

h kj = RfjUfej, j = 1, . . . , N k (2) 

where G i^NxN ar£ jj errra ti an nonnegative definite matrices and the vectors ujy G have 
independent and identically distributed (i.i.d.) elements with zero mean, variance l/N and finite 
moment of order 4 + e, for some common e > 0, 

(ii) Wfc G <C NkXTlk is a complex precoding matrix which contains < Nj, orthonormal columns of an 
Nk x Nk Haar-distributed random unitary matrix, 

(hi) Pfc = diag(pfci, . . . ,Pkn k ) £ K™ fcX ' lfc is a nonnegative diagonal matrix, 

(iv) Xfc ~ CJ\f(0, I rik ) is the transmit vector of the kth transmitter, 

(v) n ~ CJ\f(0, pIn) is a noise vector. 

Remark 1: The statistical model ([2]) generalizes several well-know channel models of interest (see 
|[23l . ll24l for examples). It comprises in particular the Kronecker channel model with transmit and receive 

correlation matrices |25h ETI . where the matrices are given as 

i i 

Hfc = R^UfcT^ (3) 

where G <C NxNk is a random matrix whose elements are independent CJ\f(0, l/N) and R^ G C NxN 
and Tk G C NkXNk are covariance matrices. Since both and are unitarily invariant, we can assume 
without loss of generality for the statistical properties of y that Tfc = diag(ifci, . . . ,tkN k )- Defining the 
matrices Rjy = t;yRfc for j = 1, . . . , Nk, we fall back to the channel model in Q. 

Remark 2: Whenever the distribution of is invariant by multiplications by unitary matrices from 
the right side (e.g. for ([3]) with = In*), our channel model boils down to 

K 



y = ^H Jfc Pfx fc + n 



k=l 



which has been studied in [23 ] for the general case ([2]) and in Ell for the Kronecker model ([3]). The same 
holds for nk = Nk for all k with uniform power allocation, i.e., = since WfcPfcW}" 1 = I Nk . 
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The next definitions will be of repeated use in the sequel. Let the matrix B^r G C be defined as 

K 

B w = ^H,WAW fc H H^. 

k=l 

We denote by In(p) the normalized mutual information of the channel ([T]), given by 

In(p) = logdet ^Iat + ^Bjyj 

expressed in nats/s. We further denote by 7^ the SINR at the output of the linear MMSE detector for 
the jth component of transmit vector xjt, which reads ll26l 

Ikj = Pkj^kj'H-k ( B N[kj] + pIat)~ H fc w fci 

where Bjv[;y] = B^ — p^jH^kj^kj^-k and Wjy is the jth column of W^. We further define the 
normalized sum-rate with single-stream MMSE detection as 

-| K n k 

fc=i j=i 

The aim of this paper is to derive deterministic approximations of the quantities ijv(p), l^j anc ^ Rn(p) 
which become almost surely arbitrarily tight as the dimensions of all involved matrices grow large. To 
make the definition of growth rigorous we need the following technical assumption: 

For k G {1, . . . ,K}, let {Nf,} = {Nk(N)} and {n^} = {n&(iV)} be sequences of integers with ratios 
Ck = Cfc(iV) = and Ck = Ck{N) = j^. The notation N — > 00 should be understood from now on 
as N, Ni, . . . , Nk, m, . . . , rix —> 00, such that < c& < 1 and < liminfTv c& < limsup^y Ck < 00. 
For all convergence results in this paper (as N — > 00), the matrices P& = P&(iV) G IR™' e>< ' 1 ' i , H^j = 
Rkj(N) G C NxN , H fc = H fe (iV) G C NxNk and W fc = W k (N) G C^** 71 * should be understood as 
families of (random) matrices with growing dimensions. Wherever this is clear from the context, we drop 
the dependence on TV to simplify the notations. 

III. State-of-the-art 

Isotropically precoded systems with linear receivers have been studied in the asymptotic limit in several 
works. Relying on results from free probability theory, the authors of lf27ll investigate the asymptotic 
performance of the MMSE receiver for the channel model ([T}, assuming K = 1, Pi = I ni and Hi 
diagonal with i.i.d. elements. Extensions of this work to frequency- selective fading channels with sub- 
optimal receivers were considered in |28l . Multi-carrier code-division multiple-access (MC-CDMA) with 
random i.i.d. and isometric spreading sequences over Rayleigh fading channels, i.e., K > 1 and 
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diagonal with i.i.d. complex Gaussian entries, was studied in [29] and approximate solutions of the 
SINR of the MMSE receiver were derived. In |30], an expression of the asymptotic spectral efficiency 
for the same model was presented. In a later work 11311 . DS-CDMA over flat-fading channels was 
considered, i.e., K > 1, n k = N and = I/v for all k. The authors derive in particular deterministic 
approximations of the Shannon- and 77-transform, exploiting the asymptotic freeness C21 Section 3.5] of 
the matrices W^P^W^ 1 . They further present a sum-rate maximizing power-allocation algorithm. The 
exact asymptotic SINR of the MMSE receiver for the general channel model ([I]) with deterministic, 
jointly diagonalizable matrices H^] was found in |[32l via incremental matrix expansions. 

In lfT9l . the channel model ([T]) with arbitrary deterministic matrices was considered and deterministic 
approximations of the Stieltjes transform, the normalized mutual information and the SINR of the MMSE 
receiver, which are asymptotically almost surely tight, were established. As the results of the current paper 
build heavily on their work, we will restate the main theorems from |[T9"1 . The aim of this paper is to 
extend these results to the case where the matrices are random and modeled according to Q. 

The first theorem in |[T9l introduces a set of 2K implicit equations whose unique solution defines 
some quantities e±, . . . , Br , e\, . . . , &k- It will turn out that the normalized mutual information as well 
as the SINR of the MMSE receiver can be expressed as functions of these quantities. Also a fixed- 
point algorithm for the computation of ei, . . . , ex, ei, . . . , &k is provided in |fT9l , which is guaranteed 
to converge to the correct solution. 

Theorem I ( Kl9\ Theorem I]): For k G {1, . . . , K}, let P fc G M™ fcXnfc be a diagonal matrix and let 
Hfe G C NxNk . Define = and = j^-. Then, the following system of implicit equations in 

(e k (z), . . .,e K (z)): 

ek{z) = "^ trP fc {ek(z)~Pk + [ck ~ e/ c (z)eA : (2;)]I„ fc ) _1 

1 ( K 
e k (z) = -trH fc H fe H I X>(z)H,H? - zl N 

has a unique solution {e\(z), . . . , Ek{z)) G C k satisfying {e\(z), . . . , ex{z)) G S K and, for z < and 
for all k, < e k (z) < c k c k /e k (z). 

The next theorem utilizes the quantities provided by Theorem [T] to establish a deterministic equivalent 
to the normalized mutual information assuming that the matrices are random. However, no particular 

'The matrices Hi, . . . ,Hk are jointly diagonalizable if there exists a unitary matrix V such that VHfcH^V H is diagonal 
for all k. 
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random matrix model is specified and the only condition is the almost surely bounded spectral radius of 
the matrices H^H^. 

Theorem 2 (HJ9\ Theorem 3]): For k G {1,...,K}, let ~P k G R^ xn * be a diagonal matrix with 
spectral norm bounded uniformly along n k and Wfc G C*^"' be n k < N k columns of a unitary Haar 

ha" 

N k °* — AT 

B ^v = Ef=i HfcWfcPfeW^H^. Further, let p > and define l N (p) = ^ logdet(Ijv + ±B N ). Then, 



distributed random matrix. Consider H k G <C NxNk a random matrix such that = H^H^ 1 G u~' NxN 
has uniformly bounded spectral norm along N, almost surely. Define c k = and c k = ^f- and denote 



X N {p)-X N {p) 

JV— >oo 



where 



1 / 1 X _ \ 

— log det I Ijv + - ^ e k K k J 

V ^ fc=i / 



N 

K r 



— log det ([c fc - e fc e fc ]I nfc + e fc P fc ) + (1 - c fc )c fc log(c fc - e k e k ) - c k \og(c k ) 



+E 

fc=i 

and where e k = e k (—p), e k = e k (—p) for all k are given by Theorem [I] 



In |[T9l Theorem 4], also a deterministic approximation of the SINR at the output of the MMSE receiver 
is provided. Again, this theorem builds upon the solutions to the fundamental equations in Theorem [T] 
and will not be stated here for brevity. In the next section, we will derive analogous results to the above 
theorems under the assumption that are random and modeled by Q. 
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IV. Main Results 

Our first result is a generalization of Theorem [T] to the channel model Q. The quantites Qkip), 9k{p) 
and 5kj(p), which are defined in the next theorem as the unique solution to a set of implicit equations, 
can be seen as the counterparts of e k and e k in Theorem [T] They will be similarly used to provide 
deterministic approximations of the mutual information and of the SINR of the MMSE detector. All 
proofs can be found in Appendix |A| 

Theorem 3 (Fundamental equations): For k G {1, . . . , K}, let P k G j£™<« xn <= be a diagonal matrix and, 
for j G {1, . . . , N k }, let Hkj G t£ NxN be a nonnegative-definite Hermitian matrix. Let p > and define 

3^ ana c fc - IV 



c k = jr an d Cfc = Then, the following system of implicit equations: 



1 / _ \ -1 

9k{p) = -jjK¥k{gk{p)Vk + [cfe - 9k(P)9k(p)] In, 





has a unique solution satisfying 5 k j(p) > 0, <?fc(/o) > and < gfc(p) < c k c k /g k (p) for all fc, j. Moreover, 
this solution is given explicitly by the following fixed-point algorithm: 

9k(p)= Hp (p) , 5fe (p) = I™ g k (p) , <5fcj (p) = .Hp 4? (p) 



where 



(p) = lim (p) , *g (p) = lim 4f (p) 



I— >oo J I— >oo J 

1 N " /) {t) (n\ 



1 



#°(p) = >P fc (r i} (p)P fc + [<* - gt%)st-%) 



Hi.- 



K N k -(*-!), x ' 



1 R [1 yy 9k (P) R fc,J | nT 



kj 

with the initial values 5%f\p) = 1/p, G [0, c k c k / g^~ l \p)) , g k = and g^ = for all k,j. 

Before we present our results on the normalized mutual information and on the SINR of the MMSE 
receiver, we need the following two assumptions about the covariance matrices H k j and the power 
allocation matrices P k : 
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A 1: Rfcj S TZn for all k,j, where TZn = {R m ,m = 1,...,M} is a family of M Hermitian 
nonnegative-definite N x N matrices, satisfying maxmllimsup^HRmH} < R < oo. 

A 2: For all k, limsup^HPfcll < P < oo. 



Remark 3: While Assumption A [I] is sufficient to ensure that the matrices H^H^ 1 have almost 
surely bounded spectral norm 1241 Proof of Theorem 3], Assumption A [2] is necessary to ensure that no 
transmitter allocates an increasing amount of power to any of the streams as N — > oo. 

The next theorem extends Theorem[2]to random matrices H k (as given by ([2])) and provides additionally 
a deterministic approximation of the normalized ergodic mutual information. 

Theorem 4 (Mutual information): Assume that Assumptions A[T]and A[2]hold true. Let p > and let 
9k = 9k(p)> 9k = 9k(p) and 5 k j = S k j(p) for all k, j be defined as in Theorem [5] Then, 



(i) I N {p) - I N (p) 0, (u) EI N (p) - I N (p) >0 

N->oo N— >oo 

where 

^ K K 

T N (p) = V N (p) + — logdet ([c fe - g k g k ] I nfc + g k P k ) + / - c k )c k \og(c k - g k g k ) - c k log(c fc ) 

k=l k=l 



(4) 

Remark 4: One can also consider an equivalent model where the matrices are extended to 
N k x iVfc matrices by adding Nk — n k zeros to their main diagonal. This leads to c k = 1 for all k and 
the expression of In(p) in Q can be simplified accordingly. 

Similarly, we obtain an extension to |[T9l Theorem 4] for the asymptotic SINR at the output of the 
MMSE receiver based on the fundamental equations in Theorem [3] 

Theorem 5 (SINR of the MMSE detector): Assume that Assumptions A[T]and A|2]hold true. Let p > 
and define g k = g k (p) and g k = g k (p), as given by Theorem [3] Then, 



where 



Tk j ~ Tk j ~ > 



-N _ Pkj9k 

Ikj ~ 



Ck — 9k9k 
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The next corollary from Theorem [5] provides an asymptotically tight approximation of the normalized 
(ergodic) sum-rate with single-stream MMSE detection: 

Corollary 1: Assume that Assumptions A [I] and Aj^jhold true and let 7^ be as defined in Theorem |5j 
Then, 



(i) R N (p) -R N (p) -^0, (u) ER N (p)-R N (p) >0 

JV— s>oo A 7 — >oo 



where 



fc=i j=i 

Our last result is the asymptotically optimal power allocation which maximizes the normalized ergodic 
mutual information under individual and sum-power constraints: 



Proposition 1 (Optimal power allocation): Let p > and I(p) be defined as in Theorem |4| and let 
P,P\,..., Pk > 0. Then, the solution to the following optimization problem: 

(Pl,...,P* K ) = arg max I N (x) 



s.t. 



n ^trP fc < P k ~ik (I) 
Ef =1 ^trP fe <P (//) 



is given by 



Pkj 



Pk (I) 

for all k,j, where P£ = diag(pjy, j = 1, . . . ,nx), 9%,9% are given by Theorem [3] for P& = P£, and A 
in (II) is chosen such that J2k=i ^T tr ^ = P - Moreover, let 

(P;,...,Pk) = arg max EI N (x) 

f^-trP fc <P fc VA: (I) 
s.t. { nk (5) 

lEf=i^trP fc <P (//) 
and assume that Assumptions A [T] and A [2] hold true, then, 

EI N (Pl ...,P* K )- I N (Pl, ...,P* K ) 0. 

Af— 5>oo 
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Remark 5: The optimal power allocation matrices Pf under a sum-power constraint (II) can be 
computed by the iterative water-filling algorithm below. Although we cannot prove the sure convergence 
of this algorithm (see |[2T1 Remark 2] and [33] for a related discussion), we know that if it converges, it 
achieves the correct solution. In our simulations, we could not create a case in which it did not converge. 



Algorithm 1 Iterative water-filling algorithm 

1: Let e > 0, t = and p$ = P k for all k,j. 

2: repeat 

3: For all k, compute g^ and gjj® according to Theorem j^j for the matrices = diag (pkfj- 

4: For all k, j, calculate = (gf - p + sf^j + , wit h A such that £f=i i Y%=i Pk^ = P - 

5: t = t + l 

6: until maxfcj |pj^. -p k j\ < e 



Remark 6: The optimal power allocation also shows that sending as many independent data streams 
as transmit antennas is optimal to maximize the ergodic mutual information. This might not be the case 
in interference channels as will be discussed later on. 

Remark 7: For the special case K=\ , Pi = I n± , N\ = n\ = N and Rij = Ijv for all j, the set of 
implicit equations in Theorem [3] reduces to: 



^-g(p)g{p)+g(pY K ' 1 + K ' i + g$ s{p) +P 

Note that 

= 1 " t" n-nt n = i - [i - 3(p) - g{p)M = [i - (i - 3(p)) 

i - g{p)g{p) + 9{p) 

which implies = 1 since 1 — g(p)g(p) > by definition. Thus, the last equations further simplify to 

M = M^, ha - 1 



which has a unique solution satisfying 5(p) > and that can be given in closed-form: 
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WiP^Xi 



W 2 P|x 2 



W 3 P|x 3 




Fig. 1. MIMO MAC from three transmitters (k — 1,2,3) with Nt antennas to a receiver with N antennas. Each transmitter 
sends streams with precoding matrix Wfc and power allocation over the channel y / OfcHfc. 



Notice that 5(p) is the Stieltjes transform of the Marcenko-Pastur law |[22l Eq. (2.20)] evaluated on the 
negative real axis. This result is consistent with our expectations since B = UU H , where U G i£ NxN has 
i.i.d. entries with zero mean and variance 1/N. Moreover, the expression of the normalized asymptotic 
mutual information as given in Theorem [4] reduces to 

In(p) = V N (p) = log (1 + 5(p) + l/p) - -M^. 

1 + S(p) 

which is consistent with the asymptotic spectral efficiency of a Rayleigh-fading N x N MIMO channel 
041 Eq. (9)] (see also |E2 Section 12.2.2]). Equivalently, the asymptotic SINR of the MMSE detector 
and the associated normalized sum-rate can be given as (cf. ||34l Proposition VI. 1]): 

7f = S(p), R N (p) =log(l + 5(p)). 

In the following section, we will present some applications of the previously derived results in the 
context of multiple-access and interference channels. 

V. Numerical Examples 

A. Multiple Access Channel 

We consider a MAC from three transmitters to a single receiver as shown in Fig. [1] The channel from 
each transmitter is modeled by the Kronecker model (see Remark [T} with individual transmit and receive 
covariance matrices and and we assume additionally a different path loss > on each link. 
The received signal vector y for this model reads 

3 _ i 
k=l 
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TABLE I 

Simulation parameters for Fig. [2] N = 10, = 8A(i — j) 



k 


iV fe 


rik 


0t,fc 

mm 


nt,k 
"max 


min 


nr,k 
Pmax 






1 


10 


8 





tt/2 


-tt/4 





4A( ? - j) 


1 


2 


5 


4 


-tt/4 


tt/4 





vr/3 


4A(»-j) 


1/2 


3 


5 


4 


-tt/2 





-7I-/3 


tt/3 


4A( ? - j) 


1/2 



We create the correlation matrices according to a generalization of Jake's model, as recently introduced 
in EH . where the elements of T& and are given as 

fmax - % in \ / 

Cmax — C m i n ^fmin V / 

where (fl^ in , 6?max) and (0^ n , #max) determine the solid angles over which useful signal power for the kth 
transmitter is radiated or received, d 1 ^ and d\^ are the distances between the antenna elements i and j at 
the A;th transmitter and receiver, respectively, and A is the signal wavelength. We assume uniform power 
allocation, i.e., = l Uk for all k, and define the signal-to-noise ratio SNR = 1/ p. The explicit choices 
of all other parameters are summarized in Table [I] 

Fig. [2] compares the normalized mutual information In{p) and the normalized rate with MMSE 
decoding R^(p), averaged over 10,000 different realizations of the matrices and W^, against their 
deterministic approximations In(p) and Rn(p)- Although we have chosen small dimensions for all 
matrices (see Table [i]), the match between both results is almost perfect. Also the fluctuations of In(p) 
and Rn{p) are rather small as can be seen from the error bars representing one standard deviation in each 
direction. The figure further illustrates the gains of optimal power allocation with a sum-power constraint 
(II), where we have chosen P = ^fe=i ^^m, = 3 to achieve the same total transmitted power as for 
uniform power allocation, i.e., = I Uk . 
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-10 -5 5 10 15 20 

SNR [dB] 



Fig. 2. Comparison of the average normalized mutual information In{p) and the normalized rate with MMSE decoding Rm{p) 
with their deterministic approximations In(p) and R^{p). Error bars represent one standard deviation in each direction. 



ri\ streams 



ri2 streams 




Fig. 3. Interference channel from two transmitters with Nk (k = 1, 2) antennas, respectively, to two receivers with N antennas 
each. Each transmitter sends independent data streams to its respective receiver. 



B. Stream-control in interference channels 

Consider a MIMO interference channel consisting of two transmitter-receiver pairs as depicted in Fig. [3] 
The received signal vectors yi,y2 € are respectively given as 

yi = HnWiXi + Hi 2 W 2 x 2 + ni 
y 2 = H 2i W]Xi + H 22 W 2 x 2 + n 2 
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where H qk G C NxNk , W k G C NkXNk , x fe ~ CM(0,P k ), P k G R^ kXNk satisfying ^trP fc = 1, and 
n k ~ CM(0, pIn), for q, k G {1, 2}. Assuming that the receivers are aware of both precoding matrices and 
their respective channels but treat the interfering transmission as noise, the normalized mutual information 
between xi and yi, and X2 and y 2 is respectively given as 

h(p) = ^logdet (l N + ^H u ,W fc P fe W, H H^ - ilogdet (l N + *H 12 W 2 P 2 W 2 H H^ 
Hp) = 4 log det ( Ijv + - V U 2k W k P k W^ k ) - 4 logdet fljv + -HaiWiPiWfH^ 



We adopt the same channel model as in Section V-A where the channel matrices Hi k are given as 



where U qk G <C NxNk have independent CAf(0, 1/N) entries and T k and H qk are calculated according 
to ([6]). We assume that no channel state information is available at the transmitters, so that the matrices 
P k are simply used to determine the number of independently transmitted streams: 

N k 




We will now apply the previously derived results to find the optimal number of streams {n^n^) maxi- 
mizing the normalized ergodic sum-rate of the interference channel above. That is, we seek to find 

{n\,n* 2 ) = maxE[Ji(p)+J 2 (p)] 

ri!,n 2 

s.t. 1 < ni < Nx, 1 < n 2 < N 2 

where the expectation is with respect to both channel and precoding matrices. Due to the complexity of 
the random matrix model, this optimization problem appears intractable by exact analysis. At the same 
time, any solution based on an exhaustive search in combination with Monte Carlo simulations becomes 
quickly prohibitive for large N\,N 2 , since N\ x N 2 possible combinations need to be tested. Relying on 
Theorem[4j we can calculate an approximation of E [h(p) + hip)] to find an approximate solution which 
becomes asymptotically exact as 7Vi and N 2 grow large. Thus, we determine {n\,n 2 ) as the solution to 

(nl,nl) = max h(p) + I 2 (p) 

n!,n 2 

s.t. 1 < m < Ni, 1 < n 2 < N 2 

where h(p), h(p) are calculated based on a direct application of Theorem [4] to each of the two log-det 
terms in h(p) and hip), respectively. The optimal values {n* 1: n^) are then found by an exhaustive search 
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TABLE II 

Simulation parameters for Fig.wUnd[5| N = 10, e£* = 4A(i — j), d\f = 4A(i — j) 



(g.fc) 


N k 


Qtk 

mm 


nt,k 
"max 


miri 


nr,q t k 
"max 


(i.i) 


10 





vr/2 


-7T/4 





(1,2) 


10 


-tt/2 








tt/4 


(2,1) 


10 





tt/2 


-7I-/3 





(2,2) 


10 


-7T/2 








vr/3 



over all possible combinations. Although we still need to compute JVi x N2 values, this is computationally 
much cheaper than Monte Carlo simulations. 

Fig. |4] and [5] show the average normalized sum-rate E [h(p) + h{p)] and the deterministic approx- 
imation by Theorem [4] /1 (p) + hip) as a function of (ni,ri2) for the simulation parameters as given 
in Table |n] We have assumed SNR = dB and SNR = 40 dB in Fig. |4] and |5j respectively. In both 
figures, the solid grid represents simulation results and the markers the deterministic approximations. 
Surprisingly, we observe an almost perfect overlap between both results for all values of (ni,n2). The 
optimal values {n\,n\) and (nf,^) coincide for both values of SNR and are indicated by large crosses 
in both figures. At low SNR, both transmitters should send as many independent streams as transmit 
antennas, i.e., n\ = ri2 = 10. At high SNR, one transmitter should use only a single stream (m = 1) and 
the other transmitter ri2 = N — 1 = 9 streams. These results are in line with the observations of ifTOll . 

Obviously, the last result is highly unfair and better solutions can be achieved by using different 
objective functions, such as weighted sum-rate maximization. Also optimal stream-control with MMSE 
decoding could be carried out in a similar manner. Although we would still need to perform and exhaustive 
search over all possible combinations of n\,n2, the computations are significantly faster than simulation- 
based approaches. The development of more intelligent algorithms to determine (n\,n\) is outside the 
scope of this paper and left to future work. The extensions to more than two transmitter-receiver pairs 
are straightforward. 



18 




Fig. 4. Sum-rate versus number of transmitted data-streams (ni, n^) for SNR = OdB and all other parameters as provided in 
Table |n| Solid lines correspond to simulation results, markers to the deterministic approximation by Theorem [4] As expected, 
both transmitters should send the maximum number of independent streams. 




Fig. 5. Sum-rate versus number of transmitted data-streams (m, riv) for SNR = 40 dB and all other parameters as provided in 
Table |TT] Solid lines correspond to simulation results, markers to the deterministic approximation by Theorem [4] As co-channel 
interference is dominant there is a clear gain of limiting the number of transmitted streams. 
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VI. Conclusion 

In this correspondence, we have studied a general channel model for randomly isometric precoded 
MIMO systems over correlated fading channels which finds useful applications in the context of multiple 
access and interference channels. For the MAC, we have derived deterministic approximations of the 
normalized (ergodic) mutual information, the (ergodic) sum-rate with single-stream MMSE decoding as 
well as the SINR of the MMSE receiver, which are almost surely asymptotically tight. Moreover, we 
have provided an asymptotically optimal power allocation algorithm under individual and sum-power 
constraints. Our results were then used for optimal stream control in interference channels. Numerical 
results show that the asymptotic results provide very tight approximations for systems with realistic 
dimensions. 
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Appendix A 
Proofs 

Proof of Theorem ^(Fundamental equations): It was shown in jl9\ Eq. (27)] that, for any fixed 
9k(p) > 0, the following equation in g~k(p)'- 

9k{p) = jf^h (s'fc(p)Pfc + [ck ~ 9k(p)9k(p)} In») 

has a unique solution, satisfying < g k (p) < CkCk/ 9k{p)- Thus, g~k{p) is uniquely determined by gk(p)- 
Consider now the following functions for k G {1, ... , K} and p > 0: 



1 + 9khj{p) 

where ^ € [0, CkCk/xt) and Skj(p) > are the unique solutions to the following fixed-point equations: 

9k = ^ trP fc ( x fc p fc + [cfc - £fc<?fc] In») (7) 

Similar to lfT9l Proof of Theorem 1, Step 2] it is now sufficient to prove that the K-variate function 
h : (x\, . . . , xk) (^-1, • • • , ^if) is a standard function [35]: 

Definition 1: A function h(xi, . . . , G is standard if it fulfills the following conditions: 

1) Positivity: if xi, . . . , x# > 0, then for all k, h^{x\, . . . , xk) > 0. 

2) Monotonicity: if a?i > x' l5 . . . , xjf > x' K , then for all fe, h^(xi, . . . , xk) > hk(x' x , . . . , x' K ). 

3) Scalability: for all a > 1 and fe, ahk(xi, . . . , x^-) > hk(axi, . . . , ax^). 



This guarantees by IT351 Theorem 2] that a standard fixed-point algorithm that consists of setting 

x^ = hk(x^ \ . . . , x^ ), A: = 1, . . . , K 

for t > and for any set of initial values xf \ . . . , x$ > 0, converges to the unique jointly positive 
solution of the system of K equations 

Xk = h k (xi, . . . ,xk), k = l,...,K. 

Showing positivity is straightforward: For p > 0, we have 5kj(p) > by Theorem [6] in Appendix |b| 
and gfc > by its definition. Thus, hk{x\, . . . , x^) > for all xi, . . . , xk > 0. 
To prove monotonicity of hk{x\, . . . ,xk), we need the following results: 
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Lemma 1 ( j[19\ Eq. (26)]): Let Xk > x' k , and consider g k and g' k the corresponding solutions to ([7]). 
Then, 

(i) 9 k < 9k (ii) x k9k > x' k g' k . 

Lemma 2: Let p > and assume ^ > gjj.. Consider Skj(p) and 5' k -(p) as the unique solutions to ([8) 
for gk and respectively. Then, 

(i) 5 kj {p) < S' kj {p) (ii) g k 6 kj {p) > g k S' kj (p). 

Proof: The proof is based on the consideration of an extended version of the random matrix model 
assumed in Theorem]^] Let us consider the following random matrices G c L7VxLAr <% given as 

1 



\/LN 



( R fci) 2 U£i, • • • , (RfejvJ 2 u L 



where Rjy = diag(R / i c:) , . . . , Rfej) € C LNxLN are block-diagonal matrices consisting of L copies of the 
matrix Rjy and \J k - G ^LNxL are ran( j om ma trices composed of i.i.d. entries with zero mean, unit 
variance and finite moment of order 4 + e, for some e > 0. We define the following matrices which will 
be of repeated use in the sequel: 

B L = f>H£(H£) H , W L = P[ (H£) H + £ ^ H f( H H 



k=l l=l,l^k 

Q = (b l + pljvx) ~\ Q' = (B' L + pI^vL 



One can verify from Theorem [6] that for any fixed N, Ni, . . . , Nk, the following limit holds: 

^trR^+pI^)- 1 ^^)- 

Thus, any properties of the random quantities on the left-hand side of the previous equation also hold for 
the deterministic quantities 5 k j(p)- We will exploit this fact for the termination of the proof. Notice that 
the matrices B L and B /L differ only by g k . This is sufficient since the case g\ > g[ for I G {1, . . . , K} 
follows by simple iteration of the case g\ = g[ for I / k and gk > g' k . 
To proof (i), it is now sufficient to show that, for any L, 

ItrR^- (Q - Q') < 0. 

By Lemma ^ this is equivalent to proving (Q) 1 — (Q') 1 >- 0, which is straightforward since 
(Q)- 1 - (QT 1 = B L - B' L = (g k - g' k )H L k (h£) H y 0. 
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Thus, 



1 



trR^. (Q - Q') 5 kj (p) - 6' kj (p) < 



since 5 k j(p) and 8' k -{p) do not depend on L. 
For (ii), we need to show that 

1 



g k — trR^Q - ^-^trR^Q' > I). 



Similarly to the previous part of the proof, it is sufficient to show that (g k Q) 1 — (fl^Q') 1 -< 0. Hence, 

(gkQ)' 1 - (g'kQT 1 = ^ (b l + pInl) - ^ + pi NL 

9k ^ ' 9 k v 



/' I — - ) Itvl + 



(i-j) E 



-< 

since p > 0, g k > g k and 5; > for all L ■ 

Consider now (xi, . . . , x#) and (x^, . . . , x^-), such that x^ > x' k Mk, and denote by (gi, ... , g^) and 
(g^, . . . the corresponding solutions to Q. Denote by S k j{p) and 5' k j(p) the unique solutions to ((8) 
for Q71, ... , gx) and (g[, . . . , g' K ), respectively. It follows from Lemma [TJ that g k < g' k Vk. Lemma [2] 
now implies that S k j(p) > S k j(p) and g k 5 k j(p) < 9 k ^' k j(p)- Combining these results yields 

v ^ ' v Up) 



h k (x u ...,x K ) = 7 



Skj{p) 



> 



1 %\p, 

V 1 + g'Jl 



f^l + g k 5 kj (p) Nf^l + g' k 5' kj (p) 

which proves monotonicity. 

To proof scalability, let a > 1, and consider the following difference: 



h k (x'i, . . . , x' K ) 



ah k (xi, . . . ,x K ) - h k (axi, 



,ax Kj 



1 a5 k j(p) "fcj 

Nj^l + g k 8 k] {p) 1 + g(«) 6 W {p) 



N 



E 



\a5 kj (p) -5 { k f(p) 


+ h 3 (p)ifiP) 


_(a) _ " 

- 9k 


[1 + 9khj 


ip)\ 




(p)~ 





where we have denoted by g k °^ the solution to ^ with x k replace by ax k and by $ k "\p) the solution 
to ^ for . We have from Lemma [l] (i) that g^ < g k and from Lemma [l] (ii) that 

9k > 0. 



-(«) ^ 
ax fc c/^ > x fc 5fc 



_(a) 



(9) 
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It remains now to show that also a5kj(p) — (p) > 0- To this end, consider the following difference: 



where 



T(p) 



N 
1 



K N k 



A' 



k=l 3 
K N, 

EE 



9k K fcj 



fc=i j=i 1 + 9 k %• IP) 



+ pi-N 



By Lemma [sj it is now sufficient to show that (T^ a \z)^) 1 >- («T(z)) 1 . Write therefore 

- (aT(p))- 1 

, K N k 



T {a \p) 



r _(a) - " 


, -(a)- 
+ 9 k '9k 


ad kj (p) - 5 { k f(p) 


a[l + 6ffc%(p)] 


^9 { k a) 4f(p)_ 





R 



■kj ■ 



The first summand is positive definite since p > and a > 1. All other terms are also positive definite 
since a^ o) - > from Q and agpg k 5 k j(p) > 9k9 k a) ^ k f {p), since > g k and g k 5 kj (p) > 

dk'^kf(p) b v Lemma [^(ii) and Lemma jlj(i). Since the sum of positive definite matrices is also positive 
definite, we have ad k j(p) — (p) > 0. This terminates the proof of scalability. 

Thus, we have shown h : (xi, . . . , xk) (hi, . . . , hx) to be a standard function. Moreover, from 
|fT9l Remark to Theorem 1] and Theorem [6j we have the following algorithm to compute g k and S k j{p): 



where 



_(t) 
9 k 



9k 



1 

N 
1 

N' 



lim g®, 

t-S>00 K 



5 kj (p) = Jim 6%(p) 



t— >oc 



trP fe ( x k P k + \c k - x k g\ 
K N k 



(*-i) 



trR 



l \ -» \ ^ <?fcRfc,j , 

iV fc=ii=i l + 9fe^j (P) 



and 5^ can take any value in [0, c k c k /x k ) and S^'(p) = l/p for all fc, j. 



(0)/ 
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Proof of Theorem 2 (Mutual information): We begin by proving the following result: 



max \e k {p) - g k (p)\ — — ■> (10) 

k N— >oo 

max|e fc (p) -ff fc (p)| ^ > (11) 

fc JV— 5>oo 

where e k (p), efc(p) are defined in Theorem [T] and g k (p), 9k(p) are defined in Theorem [3] assuming that 
the matrices are random and modeled as described in Q. For notational simplicity, we will drop 
from now on the dependence on p. For any given family (/jv,i, • • • , Jn,k)> N = 1, 2, . . . , of bounded 
real numbers,we have from standard lemmas of random matrix theory]^] 

fN,k = — trHfeH^ 1 ( /jv,iHiH^ + pljv J 

l N k / K \- 1 

j=i \i=i / 



A ^ 1 + /jv^hH /Ar,iH;HH - /jv,fch*jhH + P Iat) 1 h kj 



(b) 1 ^ ^ftrR fci /a^H^H" - /jv,feh fc jh^. + pI N 

N 3=1 1 + f N ,kjjtrK kj (y£ =1 f Nji HiH? - f N ,kh kj h^ + plj 



(c) i g ^trR fej (Eti /^H.HH + pi 



1 



iV 



N 3=1 1 + f N ,kjftrR kj /jv.iHiHH + pljy 



(d) 1 A ^trRjyT 



V N J - (12) 

" N pi 1 + /iV,^trR fci T 

where (a) follows from Lemma |4j (b) follows from Lemma [6] and Lemma [7J (c) is due to Lemma [5] and 
(d) follows from a direct application of Theorem [6] where we have defined 

K N k x 



1 fN,kR-kj . 

^^t^l + /iV,^trR fci f / '--' V 



Hence, in particular for f^ ik = e k and /jv^ = e k as defined in Theorem [TJ we can write 
* = ^ H 'H" (E 1111 + - E 1+ ltX-T + W * 

2 Let ajv and fojv denote a pair of infinite sequences of random variables. We write ajv x 6jv, iff ajv — bjv — ^» for JV — > oo. 
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for some sequences of reals cn k, satisfying epj k — — — > 0. 
Recall now the following definitions for k = 1, . . . , K : 



9k 



1 • <,;.rH,, ; T A7 '- 

N k i 



1 ^ Jftt^lkjT 



9k 



= > - r^- , < e fc < CfcCfc/efc 

N ^ c fc - e fc e fc + e fc p fci 

^7$^- zr 1 -. , 0<g k <c k c k /g k 

N ~[ c k~ 9k9k + 9kPkj 



where 



K N, 



T = 



1 \ - \ - efcRfcj 



^V^^l + /iV,^trR fej T 

K N k 



k=l j 

I 



1 \ ^ \ -* 9kR-kj 




A. Case: limsupcfc < 1 

We will first assume that limsupcfc < 1 for all k. The case limsupcfc = 1 will be treated sep- 
arately in the subsequent section. Denote P = maxfc{limsup||Pfc||}, R = max m {limsup||R m ||}, 
c + = maxfcjlimsupcfc} and c_ = min^jliminf dk}, c + = max^jlimsupcfc}. Since we are interested 
in the asymptotic limit N — > oo, we assume from the beginning that N is sufficiently large, so that the 
following inequalities hold for all k: 

Ck < c+, c_ < c k < c+, ||Pfc|| < P, \\Rkj\\<R- 

We then have the following properties: 

P P 

e k < jz r^, 9k < jz rz~, 9k9k < c+c+, e k e k < c + c + . (13) 

(1 - c+)c_ (1 - c + )c_ 

For notational simplicity, we define the following quantities: 

a = max|e fe - g k \, a = max \e k - g k \. 

k k 

We will show in the sequel that, almost surely, a — > and a — > as N — > oo. 
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Consider first the following difference: 



sup 



trR fej (T - T) 



sup 

k,3 



sup 

k,3 



1 

iV' 



: trRjtjT 



N 



K N, 

EE; 



e z R, 



giR, 



l in 



1=1 m- 



< x 1 + e z ^trR m T 1 + g z ^trR Im T 



K N, 



ei-gi + em (^trR m T - ifrR im T) 1 



N 



\ - \ - — yi t 



1=1 m=l 



< —^K maxcfc 
p 2 k 



+ e,itrR 4m T) (l + g,itrRj m T) JV 

1 



trRfcjTR; m T 



max |e fc - g k \ + max |e fc 5fc| sup 

k k fcj 



N 



trR fcj (T - T) 



P 2 



a + 



P 2 



sup 



(1 - C+)V fej 



trR fcj (T - T) 



where the first equality follows from Lemma [3] Rearranging the terms yields: 



sup 



N 



trR fcj (T - T) 



< 



P 2 Kc+ 



R 2 P 2 



a 



P (l-c+) 2 c 



RP 



fOT P > (l-c + )c_ - 

Consider now the term a = maxfc \e k — g k \ : 



a = max 

k 



N k l 



trRfcj (T — T) + (g k — e k ) jjtrtl k j jjtr~R k jT 



< c + sup 

kj 



1 



iV 



trRfcj (T-T 



1 + efc^trRfcjT) (l + 5 fe itrR fei T) 
R 2 



+ £N,k 



max|efc - g k \ + maxje^fcl 



P 



< 



P 2 Kc 2 



R?P 2 
' {l-c + ) 2 c 2 _ 

P 2 Kc\ 

R 2 P 2 



-a H K—a + max lejy k \ 

fr k 



+ 



c + R 2 
P 2 



a + max | e 7v,fc | 

k 



{l-c + ) 2 c 2 _ 

where the last inequality follows from ( [T4] ). Similarly, we have for a = maxfc \e k — g k \: 



a = max 

k 



e k e k ~ 9k9k +Pkj(9k ~ e k ) 



3=1 



< 



< 



1 ^\phmax k \e k - g k \ 

3=1 
p2 



(cfc - e fc efc + e k p k j)(c k - g k g k + gfcPfcj) 

maxfc [e fc |efc - g k }\+ max fc [fiffc I e fc - g k \] 



N 



(1 - c+) 2 c 2 



+ Pkj 



(1 - C+) 2 C 2 



1 + 



1 



Thus, for p > max { , ^j^jg 



a < 



(1 - c+)c. 
|, we have 
2P 2 



a + 



(1 

PRc + 



c+) 2 c 2 _ 



p(l-c + ) 2 c 2 



a. 



(1 - c+) 2 c 2 



1 + 



1 

(1 - c+)c- 



Q. 



(14) 



(15) 



(16) 
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Replacing (16) in (15 1 leads to 



a < 



P 2 Kc% c + R 2 

_ B 2 P 2 + „2 
(l-c+) 2 5^ 1 

For p sufficiently large, we therefore have 



2P 2 ( 1 

(i - c+ yc 2 _ \ l + (i - c+)c -_ ) a + m r tol 



> 0. 



a < 2 max | e^r fe | — 

A; ' N— >oo 

This implies by ( fT6| > that also a — a ' s ' > . Since gt, &hi9k are uniformly bounded on all closed subsets 
of and is almost surely uniformly bounded on all closed subsets of M. + , we have from Vitali's 
convergence theorem [36] that the almost sure convergence holds true for all p € M+. This terminates 
the proof for Ck < 1. 



B. Case: limsupcfc = 1 

It was shown in [19, Proof of Theorem 1], that the following refined inequalities hold for Cfc = 1: 

efc <P, gk< P. 



Using these inequalities instead of ( 13 1 in the proof of the case c& < 1, one can show that a and 



a 



as N ->■ oo. 



C. Convergence of the mutual information 

Consider now the first term of Vjv(p) in Theorem [2] Due to the convergence of — — > 0, we 
have from the continuous mapping theorem [37, Theorem 2.3] that 



^ log det fliV + ^ ejfcH fc H^ - ^ log det fliV + ^ 5fcH fc H£ 



AT-j-oo 



-> 



since ||X]fc=i( e fc ~~ c/fc )HfcH^ 1 1 0. Applying Corollary bho the second therm yields 



i log det ^Ijv + ^Xj& H * H "l " 



TV -s> 00 



-»■ 0. 



(17) 



Consider now In(p) and In{p) as defined in Theorems [2] and [4] It follows from (10 1, (111 and (17), that 



1n{p) ~ In(p) 0. 

Af-5>oo 



This implies also that 



(18) 
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To prove mean convergence (ii), consider S7, the probability space that engenders the sequences {Wi, . . . , 
Wx,Hi, . . . ,Hr-}. Then, on a subspace of of measure 1, we have by ( fT8] ): In(p) — Tn{p) — > 
as N — > oo. The results follows directly by integrating this expression over Q., using the dominated 
convergence theorem ll38l Theorem 16.4]. ■ 

Proof of Proposition U\ By the chain rule of differentiation, we have 



dI N (p) dI N (p) 



A 



dI N (p) dgi dI N (p) gi 



dpkj dp kj ^ [ dgi dpkj dgi dp kj _ 
Consider now the partial derivative: 

di N { P ) dv N { P ) , 1 „ , „ , r _ _ 1T ,_i .in i 



dgi dgi N N q - ^ + q - 

9V^v(p) , _ / (1 - (H)ci 1 ^ 1 
+ 9i\l - — - > 



dgi \ Cj - ftffi N ^ a- gigi + 5^ 

% 

where the last equality follows from 

q _ _ ly e. - ffift + ffiKj 1 g - gjgj 

N 4-1 Ci — gigi + giPij N -f-f Cj - pift 

, «i 2 1 n * 

= Ci~ {Ci ~ 9i9i)vJ2 >„.„.. + 9i M ~ . _ ^ I ~ . " " 



A AT 



-/V ^ Cj - gigi + ^pij 2V Ci - gigi + ' a - gigi 



'1 - Ci)a 1 A 1 



{a - 9i9i) 1 - — - -r- T 2, - — 



Ci - gigi N ~[ c i~ 9i9i + 9iPij 
and Ci > CiCi > g^i by definition. 
Similarly, we have 

di N (p) dv N { P ) I (1 - a)ci , 1 A 1 



ft 



— ^= 9i- 



Ci - gigi N ~^[ c i~ 9i9i + 9iPij 



It remains now to calculate the partial derivatives 9V g g ^ and 9V q^ ■ To this end, notice that 

1 ,1 A 1 A w trR fc? T 



1 = -trTT- 1 = p-trT + Y g k - V * 7 fc ' J 

1 K 
= P^trT + ^ 5fc5fc . 



fe=i 
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Replacing Y,k=i9k9k in @ by ^trTT" 1 - p^trT yields 



K N k 



Vn( P ) 



N 



log det 



( P T) - ItrTT- 1 + pltrT + ± £ £ log ( 1 + 9k^K k ,T) ■ 



k=l j=l 

Taking the derivative with respect to and denoting T' = |? leads to 



dV N (p) 



trT T + o trT -I \ \ N — I 



JV 



1 

iV 



K N k 



EE r 



ffifc-^trRfe ,T' 



i ^ 1 + 5^trR fej T + P iV trT + iV S E ! + ^itrR^T 



1 



1 l + ^trRyT 

^ ft 1 tj rp, 



fc=l j 



+ ft 



This implies that 9I q^ = 0. We similarly have 



0ft 







and hence 



9/jy(p) 
S9< 



0. Putting the last results together yields 

dI N (p) _ dI N (p) _ g k 



dpkj dp kj N (c fe - g k g k + fiffcPfcj) ' 
We can calculate the second derivative in a similar manner: 

d 2 I N {p) d 2 I N (p) 



(19) 



9l 



dp 2 k 



dpi 



< 



J kj "fkj N ( c k - 9k9k + 9kPkj) 

since g k > 0. Thus, Tn(p) is a concave function in p k j for all k,j. It is straightforward to verify that 
also In(p) is concave in all p k j. 

Consider now the Lagrangian functions related to the power constraints (I) and (II): 



£(A,Ai, . . .,X K ,p n , . . . , pkn k ) = < 



(20) 



(^) 



We have from ( 19 1 



dp k j 



3k At /j\ 

N(c k -g k g k +g k p kj ) n k \ > 



(21) 



k N(c k -g k g k +g k p kj ) n k 

Solving for the Karush-Kuhn- Tucker conditions [ 39 ] for both cases yields the desired result. 

Take now the optimal solutions P* = (P*, . . . , P^) and P* = (P^, . . . , P* K ) and consider the fol- 
lowing difference: 



I N (P*) - I N (P*) = [l N (P*) - I N (P*)] + [In(P*) - In(P*)] + [In(P*) - In(P*) 
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where we used Jjv(P*) and Ijv(P*) to denote In(p) an d In(p) evaluated for the matrices (P^, . . . , P^) 
and (Pj, . . . , P^-) , respectively. Assuming that max^ limsupjy||P^|| < oo, we have from Theorem [4] 



I N (P*) - I N (P*) 
7jv(P*) - Ijv(P*) 



a.s. 



JV-»oo 
a.s 



-»• 



> 0. 



Since Ijv(P*) - /jv( p *) > and Ijv(P*) - /jv(P*) < 0, we can conclude that 



-> 0. 



I N (P*) - I N (P*) - 

JV-»oo 

It remains now to show that the matrices P£ satisfy indeed max^ limsupjy||P^|| < oo. Consider therefore 
the following expression: 



1 



I. 



Pkj- 



EI N = E- logdet I Ijv + ^B N[kj] + ^H t w tj w;.H H 
which is clearly strictly concave in pkj for all k,j. The corresponding derivative with respect to p^j reads 

1 H 



9EI 



iV 



1 



1. 



E— tr Ijv + B 



>/V 



N V P 
E ^ W " HfcH ( l7V + ^ 



-H fc w fci wJjH!? 



-i 



Similar to (21 1, the derivative of the Lagrangian to the optimization problem (|5]) is given as 

-l 

H fc w fci - nk v . , 



^w".h£ Ijv + JBjv 



-i 



H fc w fcj n* 

Consider now constraint (I). At the optimal point, we need to have = 0, and therefore 



E^wftHj; (ijv + ^Bjv^) H fc w fci 



dpkj 

Afc 



Since the right-hand side is independent of j, it follows that P^, — pi t l rik where pk is a parameter to 
be optimized. Since ^-trP£ = p k < Pj,, we have maxjf limsupjv||P^|| = Pk < P/b < oo. The same 
arguments hold for the sum-power constraint (//). ■ 



Proof of Theorem^ The proof follows directly from (lOi and ( |TT| ) applied to [191 Theorem 4]. 



Proof of Corollary [/} The proof of (i) follows directly from the continuous mapping theorem 11371 
Theorem 2.3]. Denote the probability space engendering the sequences {Wi, . . . , Wjf, Hi, . . . , H^-}. 
Then, on a sub-space of f2 of measure 1, we have by Theorem [4} Rn(p) — R~n(p) — > as N — > oo. 
Integrating this expression over Q using dominated convergence arguments proves (ii). ■ 
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Appendix B 
Related results 

Theorem 6 ( l[23\ Theorem 1]): Let Bjy = XX H , where X G i£ Nxn j s random. The jth column Xj 

i 

of X is given as Xj = R|u.j, where the entries of uj G C w are i.i.d. with zero mean, variance 
1/N and finite moment of order 4 + e, for some common e > 0, and Rj G i^NxN ar£ jj errri i t i an 
nonnegative definite matrices. Let D^r G C NxN be a deterministic Hermitian matrix. Assume that both 
Uj and Dtv have uniformly bounded spectral norms (with respect to N). Then, as n, N — > oo such that 
< liminf N/n < limsup N/n < oo, the following holds for any z G C \ M+: 

-J-trD^v (Bjy - ^Ijvr 1 - ^txD N T N (z) ^ > 

iv IV TV.n— »oo 



where TV(z) G C 7Vx7V is defined as 




and where 5i(z), . . . ,6 n (z) are given as the unique solution to the following set of implicit equations: 
such that (5i(z), . . . ,5 n (z)) G S n . For z < 0, 5\{z), . . . ,5N,n(z) are the unique nonnegative solutions 



to (22 1 and can be obtained by a standard fixed-point algorithm with initial values dj (z) = —1/z for 



j = 1, . . . ,n. Moreover, let Fn be the empirical spectral distribution (e.s.d.) of B^r and denote by Fn 
the distribution function with Stieltjes transform itrTjv(^). Then, almost surely, 

F N -F N => 0. 

Theorem 7 (fctOll): Under the assumptions of Theorem [6| let p > and define the quantity Vn(p) = 
^logdet (l N + JBtv). Then, 

EV N {p) -Vjv(p) >0 

N,n— >oo 



where 

JW) = i !o g de, L + lit i) + ± £>(! + « - 1 £ ^ 

\ H 3=1 3 / 3=1 3=1 J 

and where Sj = Sj(—p) for j = 1, . . . , n are given by Theorem [6] 
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Corollary 2: Under the assumptions of Theorem [7j assume additionally that the matrices Rj , j 
1, . . . ,n, are drawn from a finite set of Hermitian nonnegative-definite matrices. Then, 



V N (p) - V N (p) (23) 

N,n— >oo 

where Vn(p) and Vjv(p) are defined as in Theorem [7] 

Proof: It was shown in ll24l Proof of Theorem 3] that B^r has almost surely uniformly bounded 
spectral norm as N, n — > oo if the matrices are drawn from a finite set of matrices. Thus, Fn and Fjy 
as defined in Theorem [6] have (almost surely) bounded support. Consider now a probability space A C £1, 
f2 generating the matrices Bjv, for which B^r has bounded spectral norm, and another probability space 
B C J) for which F N - F N ^ 0. Since P(A) = P(P) = P(A n J3) = 1, it follows from [38, Theorem 
25.8 (ii)], that 

/ log(l + x\)dF N (\) - [ log(l + x\)dF N (X) a ' s - ) (24) 

J J iV,n->oo 



which is equivalent to stating that VW(x) — Vn(x) — — — > 0. ■ 

N,n—>oo 

Appendix C 
Useful Lemmas 

Lemma 3 (Resolvent identity): Let A and B be two invertible matrices. Then, 

A" 1 - B 1 = A" 1 (B - A) B _1 . (25) 

Lemma 4 (Matrix inversion lemma SB Eq. (2.2)]): Let A G c NxN be Hermitian invertible. Then, 
for any vector x G and any scalar t£C such that A + rxx H is invertible, 

x"( A + t xx") = 1 + tx „ a - 1x - 

Lemma 5 (Rank-1 perturbation lemma MB): Let z < 0, A € C iYxAr , B G c WxAr with B Hermitian 
nonnegative definite, and v G C . Then, 



tr f (B - zIn)- 1 - (B + vv H - zljv)" 1 ) A 



IIAII 

|z| 

Lemma 6 (Trace lemma / l?2] Lemma 2.7]): Let Ai, A2, . . • , with A^ G C 7VxAr , be a sequence of 
matrices with uniformly bounded spectral norm and let x^r =G C w be random vectors of i.i.d. entries 
with zero mean, variance l/N and eighth order moment of order 0(1/N A ), independent of Ajy. Then, 

x%A N x N - ^trAiv 0. (28) 

IV N—>-oo 
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Lemma 7: B3l Lemma 1] Denote ajy, a at, bjy and T>n four infinite sequences of complex random 
variables indexed by N and assume a at x a at and bjy x b^. If \a^ |, |6jv | and/or |ajv|,|fe/v| are uniformly 
bounded above over N (almost surely), then a^b^ x a^bx- Similarly, if |ajv|, | I 1 and/or |aAr|,|67v| _1 
are uniformly bounded above over N (almost surely), then /&/v ~ /i>N- 

Lemma 8 (Trace inequality): Let A,B,R 6 C NxN , where A and B are nonnegative-definite, satis- 
fying B >- A, and R is nonnegative-definite. Then 

trR (A" 1 - B" 1 ) > 0. (29) 

Proof: Note that B >- A implies by fl44j Corollary 7.7.4] B 1 ~< A" 1 . Thus, for any vector x € C^, 

x H (A" 1 - B" 1 ) x > 0. (30) 

Consider now the eigenvalue decomposition of the matrix R = UAU H , where U = [ui , . . . , Ujv] and 
A = diag(Ai, . . . , Xn)- Since Aj > Vz, we have 

N 

trR (A" 1 - B" 1 ) = X i u i ( A_1 - B_1 ) u * > °- ( 31 ) 

i=i 

■ 
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